Search Result

Select

Forest pest detection method based on attention model and lightweight YOLOv4

Haiyan SUN, Yunbo CHEN, Dingwei FENG, Tong WANG, Xingquan CAI

Journal of Computer Applications 2022, 42 (11): 3580-3587. DOI: 10.11772/j.issn.1001-9081.2021122164

Abstract （343）

HTML （8）

PDF （4972KB）（126）

Save

Aiming at the problems of slow detection speed， low precision， missed detection and false detection of current forest pest detection methods， a forest pest detection method based on attention model and lightweight YOLOv4 was proposed. Firstly， a dataset was constructed and preprocessed by using geometric transformation， random color dithering and mosaic data augmentation techniques. Secondly， the backbone network of YOLOv4 was replaced with a lightweight network MobileNetV3， and the Convolutional Block Attention Module （CBAM） was added to the improved Path Aggregation Network （PANet） to build the improved lightweight YOLOv4 network. Thirdly， Focal Loss was introduced to optimize the loss function of the YOLOv4 network model. Finally， the preprocessed dataset was input into the improved network model， and the detection results containing pest species and location information were output. Experimental results show that all the improvements of the network contribute to the performance improvement of the model； compared with the original YOLOv4 model， the proposed model has faster detection speed and higher detection mean Average Precision （mAP）， and effectively solves the problem of missed detection and false detection. The proposed new model is superior to the existing mainstream network models and can meet the precision and speed requirements of real?time detection of forest pests.

Table and Figures | Reference | Related Articles | Metrics

Select

Violence detection in video based on temporal attention mechanism and EfficientNet

Xingquan CAI, Dingwei FENG, Tong WANG, Chen SUN, Haiyan SUN

Journal of Computer Applications 2022, 42 (11): 3564-3572. DOI: 10.11772/j.issn.1001-9081.2021122153

Abstract （402）

HTML （11）

PDF （2885KB）（121）

Save

Aiming at the problems of large model parameters， high computational complexity and low accuracy of traditional violence detection methods， a method of violence detection in video based on temporal attention mechanism and EfficientNet was proposed. Firstly， the foreground image obtained by preprocessing the dataset was input to the network model to extract the video features， meanwhile， the frame-level spatial features of violence were extracted by using the lightweight EfficientNet， and the global spatial-temporal features of the video sequence were further extracted by using the Convolutional Long Short-Term Memory （ConvLSTM） network. Then， combined with temporal attention mechanism， the video-level feature representations were obtained. Finally， the video-level feature representations were mapped to the classification space， and the Softmax classifier was used to classify the video violence and output the detection results， realizing the violence detection of video. Experimental results show that the proposed method can decrease the number of model parameters， reduce the computational complexity， increase the accuracy of violence detection and improve the comprehensive performance of the model with limited resources.

Table and Figures | Reference | Related Articles | Metrics

Select

Four-layer multiple kernel learning method based on random feature mapping

Yue YANG, Shitong WANG

Journal of Computer Applications 2022, 42 (1): 16-25. DOI: 10.11772/j.issn.1001-9081.2021010171

Abstract （378）

HTML （24）

PDF （665KB）（157）

Save

Since there is no perfect theoretical basis for the selection of kernel function in single kernel network models， and the network node size of Four-layer Neural Network based on Randomly Feature Mapping （FRFMNN） is excessively large， a Four-layer Multiple Kernel Neural Network based on Randomly Feature Mapping （MK-FRFMNN） algorithm was proposed. Firstly， the original input features were transformed into randomly mapped features by a specific random mapping algorithm. Then， multiple basic kernel matrices were generated through different random kernel mappings. Finally， the synthetic kernel matrix formed by basic kernel matrices was linked to the output layer through the output weights. Since the weights of random mapping of original features were randomly generated according to the random continuous sampling probability distribution randomly， without the need of updates of the weights， and the weights of the output layer were quickly solved by the ridge regression pseudo inverse algorithm， thus avoiding the time-consuming training process of the repeated iterations. Different random weight matrices were introduced into the basic kernel mapping of MK-FRFMNN. the generated synthetic kernel matrix was able to not only synthesize the advantages of various kernel functions， but also integrate the characteristics of various random distribution functions， to obtain better feature selection and expression effect in the new feature space. Theoretical and experimental analyses show that， compared with the single kernel models such as Broad Learning System （BLS） and FRMFNN， MK-FRMFNN model has the node size reduced by about 2/3 with stable classification performance； compared with mainstream multiple kernel models， MK-FRMFNN model can learn large sample datasets， and has better performance in classification.

Table and Figures | Reference | Related Articles | Metrics

Select

Open set fuzzy domain adaptation algorithm via progressive separation

Xiaolong LIU, Shitong WANG

Journal of Computer Applications 2021, 41 (11): 3127-3131. DOI: 10.11772/j.issn.1001-9081.2021010061

Abstract （439）

HTML （10）

PDF （743KB）（135）

Save

The aim of domain adaptation is to use the knowledge in a labeled （source） domain to improve the model classification performance of an unlabeled （target） domain， and this method has achieved good results. However， in the open realistic scenes， the target domain usually contains unknown classes that are not observed in the source domain， which is called open set domain adaptation problem. For such challenging scene setting， the traditional domain adaptation algorithm is powerless. Therefore， an open set fuzzy domain adaptation algorithm via progressive separation was proposed. Firstly， based on the open set fuzzy domain adaptation algorithm with membership degree introduced， the method of separating the known and unknown class samples in the target domain step by step was explored. Then， only known classes separated from the target domain were aligned with the source domain， so as to reduce the distribution difference between the two domains and perform the fuzzy adaptation. The negative transfer effect caused by the mismatch between unknown and known classes was reduced well by the proposed algorithm. Six domain transformation experimental results on Office dataset show that， the accuracy of the proposed algorithm has the significant improvement in image classification compared with the traditional domain adaptation algorithm， and verify that the proposed algorithm can gradually enhance the accuracy and robustness of the domain adaptation classification model.

Table and Figures | Reference | Related Articles | Metrics

Select

Particle filter tracking algorithm based on adaptive subspace learning

WU Tong WANG Ling HE Fan

Journal of Computer Applications 2014, 34 (12): 3526-3530.

Abstract （213）

PDF （805KB）（610）

Save

In order to improve the robustness of visual tracking algorithm when the target appearance changes rapidly, a particle filter tracking algorithm based on adaptive subspace learning was presented in this paper. In the particle filter framework, this paper established a state decision mechanism, chose the appropriate learning method by combining the verdict and the characteristics of the Principal Component Analysis (PCA) subspace and orthogonal subspace. It not only can accurately, stably learn target in low dimensional subspace, but also can quickly learn the change trend of the target appearance. For the occlusion problem, robust estimation techniques were added to avoid the impact of the target state estimation. The experimental results show that the algorithm has strong robustness in the case of illumination change, posture change, and occlusion.

Reference | Related Articles | Metrics

Select

Multi-plane detection algorithm of point clouds based on volume density change rate

CHU Jun WU Tong WANG Lu

Journal of Computer Applications 2013, 33 (05): 1411-1419. DOI: 10.3724/SP.J.1087.2013.01411

Abstract （746）

PDF （951KB）（599）

Save

Most existing methods for detecting plane in point cloud cost long operation time, and the result of detection is susceptible to noise. To address these problems, this paper put forward a kind of multi-plane detection algorithm based on geometric statistical characteristics of the point clouds. The proposed method coarsely segmented point clouds according to the change rate of the volume density firstly, then used the Multi-RANSAC to fit planes, at last the authors proposed a new merge-constraint condition to combine and optimize the initial fitted planes. The experimental results show that the method in this paper is easy to realize, can effectively reduce the influence of cumulative noise to the detection results, improve the plane detection accuracy and also greatly reduce the computing time.

Reference | Related Articles | Metrics

Select

Maximum a posteriori classification method based on kernel method under t distribution

Ru-yan ZHANG Shi-tong WANG Yao XU

Journal of Computer Applications 2011, 31 (04): 1079-1083. DOI: 10.3724/SP.J.1087.2011.01079

Abstract （1552）

PDF （738KB）（411）

Save

In order to solve the problem of multivariate normal distribution failing to comply with the distribution of sample data when it has a serious tailing phenomenon, a multiclass classification method under t distribution was proposed. Sample data were extended to the high dimensional feature space by kernel method and Maximum A Posteriori (MAP) was obtained by Bayesian classifier, and then classification result could be gotten. Because it has another degree of freedom parameter v in multivariate t distribution, it can more easily capture the complexity of the sample data, and enjoys better robustness. A large number of experiments have been done on the five international standard UCI data sets and three facial image data sets, and the experimental results show that the method achieves better classification and is feasible.

Related Articles | Metrics

Select

Application of WMD Gaussian kernel in spectral partitioning

Shi-tong WANG

Journal of Computer Applications

Abstract （1718）

PDF （655KB）（2248）

Save

To obtain a better segmentation result, this paper used Weighted Mahalanobis Distance (WMD) Gaussian kernel for Nystrm-Ncut segmentation. It proves that weighted Mahalanobis distance Gaussian kernel is more appropriate for spectral graph theoretic methods than Mahalanobis distance, because weighted Mahalanobis distance can compute the similarity between two pixels more accurately.

Related Articles | Metrics

Select

Affinity propagation clustering for symbolic interval data based on mutual distances

Xin-xi XIE Shi-tong WANG

Journal of Computer Applications

Abstract （1678）

PDF （628KB）（1068）

Save

Clustering for symbolic data is an important extension of conventional clustering, and interval representation for symbolic data is often used. The symmetrical measures in conventional clustering algorithms are sometimes not fit to interval data and the initialization is another severe problem that can affect the clustering algorithms. One metric called mutual distances for interval data was proposed; based on the metric, a new clustering method named affinity propagation clustering that could solve the problem initialization was used. Then, affinity propagation clustering for symbolic interval data based on mutual distance was given. Theoretical explanation and experiments indicate that the proposed algorithm outperforms K-means based on Euclidean distances for the interval symbolic data.

Related Articles | Metrics

Select

Improved order-invariant watershed algorithm for image segmentation

Shi-Tong WANG

Journal of Computer Applications

Abstract （2082）

PDF （876KB）（951）

Save

Creative lake minimum was introduced to represent topographical information of a pixel, so as to reduce the number of RIDGE labels in orderinvariant watershed algorithm. Watershed fall was used to reduce the number of over-segmented regions. This algorithm was implemented in both immersion and toboggan ways. Experiment results show that the proposed algorithm is still order-invariant, and it can reduce the number of RIDGE labels by about 80% as well as segmented regions by 5%~20%.

Related Articles | Metrics

Select

Web document representation method based on new-word discovery

Chun-Ying WU Shi-Tong WANG Chong-Chao CAI

Journal of Computer Applications

Abstract （1735）

PDF （783KB）（1231）

Save

Web document representation is important foundation of Web page feature selection and categorization. At present, the commonly used method is word-based vector space model. Because the model does not consider the latent concept construction of text, and a Web document is in semi-structural form where many new-words perhaps occur, a new Web document representation method based on new-word discovery was presented in this paper. This method first preprocessed the Web page, and segmented words for the converted documents; then tried to discover new-words by using bi-gram and mutual information, added new-words into its original dictionary, and segmented words for text again, finally represented the Web document using words and new-words. The experimental results show that it can help us to identify unknown words and extend the current dictionary, strengthen the representation of Web documents, improve the quality of the adopted vector, and increase the effect of Web document categorization.

Related Articles | Metrics

Select

Morphological color image filter based on fuzzy combination and soft-multi-structural elements

Shi-Tong WANG

Journal of Computer Applications

Abstract （1618）

Save

A new filter on color morphology based on fuzzy combination and soft-multi-structural elements was presented. Color image pixels were sorted by the value of fuzzy combination. Morphological filter based on the ranking of fuzzy combination can get better results than based on the ranking of HSV. Experimental results demonstrate that the ability of color processing based on the new morphology is superior to the classical one, and that the algorithm possesses excellent performance on the noise restrained.

Related Articles | Metrics

Select

Classification text with incomplete data based on Bernoulli mixture mode

CAI Chong-Chao CAI WANG Shi-Tong WANG

Journal of Computer Applications

Abstract （2019）

PDF （715KB）（868）

Save

It is an important issue to construct the text classification with incomplete data. An improved method that based on Bernoulli Mixture Model and Expectation Maximization(EM) algorithm was introduced. Based on Bernoulli Mixture Model and EM algorithm, by learning the labeled data, the initial value of likelihood function parameter was obtained first. Then the parameter estimate of prior probability model on the classifier with EM algorithm including weight was presented. Finally we got the improved classifier. The results show that our new method is better than the nave bayes text classification in the recall and precision.

Related Articles | Metrics